K | # of bigrams | # of trigrams | # of 4-grams | # of 5-grams | # of 6-grams |
---|---|---|---|---|---|
100 | 78 | 88 | 99 | 99 | 99 |
1000 | 320 | 689 | 874 | 942 | 973 |
10000 | 883 | 3141 | 5790 | 7730 | 8852 |
100000 | 1718 | 6744 | 14558 | 22035 | 27392 |
1000000 | 1718 | 6744 | 14558 | 22035 | 27392 |
Both the problem and the results are much similar to the previous subsection: We consider letter-N-grams at the end of words instead of the beginning.
3.8.1 Number of letter-N-grams at word beginnings